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Abstract — The activity patterns of liiglily developed cognitive 
systems lilce tlie Iiuman brain are dominated by autonomous 
dynamical processes, that is by a self-sustained activity which 
would be present even in the absence of external sensory stimuli. 

During normal operation the continuous influx of external 
stimuli could therefore be completely unrelated to the patterns 
generated internally by the autonomous dynamical process. 
Learning of spurious correlations between external stimuli and 
autonomously generated internal activity states needs therefore 
to be avoided. 

We study this problem within the paradigm of transient state 
dynamics for the internal activity, that is for an autonomous 
activity characterized by a infinite time-series of transiently 
stable attractor states. We propose that external stimuli will 
be relevant during the sensitive periods, the transition period 
between one transient state and the subsequent semi-stable 
attractor. A diffusive learning signal is generated unsupervised 
whenever the stimulus influences the internal dynamics quali- 
tatively. 

For testing we have presented to the model system stimuli 
corresponding to the bar-stripes problem and found it capable 
to perform the required independent-component analysis on its 
own, aU the time being continuously and autonomously active. 

I. INTRODUCTION 

It is well known that the brain has a highly developed 
and complex self-generated dynamical neural activity. We 
are therefore confronted with a dichotomy when attempting 
to understand the overall functioning of the brain or when 
designing an artificial cognitive system: A highly developed 
cognitive system, such as the brain, is influenced by sensory 
input but is not driven directly by the input signals. The 
cognitive system needs however this sensory information 
vitally for adapting to a changing environment and survival. 
In this context we then want to address a two-fold goal: 
> Can we formulate a meaningful paradigm for the self- 
sustained internal dynamics of an autonomous cognitive 
system? 

• How is the internal activity process influenced by 
sensory signals, viz which are the principles for the 
respective learning processes? 
With respect to the first problem we note that an increasing 
flux of experimental results from neurobiology supports the 
notion of quasi-stationary spontaneous neural activity in the 
cortex [1], [2], [3], [4], [5]. It is therefore reasonable to 
investigate possible set-ups for artificial cognitive systems 
centrally based on the notion of internally generated sponta- 
neous transient states, as we do in the present investigation. 




Fig. 1. Schematic illustration of general or fluctuating activity patterns 
(top) and of transient-state dynamics (bottom), which is characterized by 
typical time scales ttrans and At for the length of activity-plateau and of 
the transient period respectively. 



II. GENERAL OUTLINE 

We will present the model and the issues being studied 
in two steps, starting with a general overview, discussing 
technical details in a second step. 

A. Transient-state and competitive dynamics 

There are many notions of complex system theory char- 
acterizing the long-term behavior of a dynamical system [6], 
such as regular vs. chaotic behavior The term 'transient-state 
dynamics' refers, on the other hand, to the type of activity 
occurring on intermediate time scales, as illustrated in Fig.[T| 
A time series of semi-stable activity patterns, also denoted 
transient attractors, is characterized by two time scales. The 
typical duration ttrans of the activity plateaus and the typical 
time At needed to perform the transition from one semi- 
stable state to the subsequent one. The transient attractors 
turns into stable attractors in the limit ttrans/ At — ^ oo. 

Transient state dynamics is intrinsically competitive in 
nature. When the current transient attractor turns unstable the 
subsequent transient state is selected by a competitive pro- 
cess. Transient-state dynamics is a form of 'multi-winners- 
take-air process, with the winning coalition of dynamical 
variables suppressing all other competing activities. 

Similar processes have been proposed to be relevant for 
various neural functionalities. Edelman and Tononi [7], [8] 
argue that 'critical reentrant events' constitute transient con- 
scious states in the human brain. These 'states-of-mind' are 
in their view semi-stable global activity states of a contin- 
uously changing ensemble of neurons, the 'dynamic core'. 
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Fig. 2. The growth rates ri(t) generating an internal transient state- 
dynamics via Eq. Q. The two examples differ in the functional dependence 
of the Tiit) on the Xjit). The top graph corresponds to a system having 
sensitive periods, the bottom graph to a system without distinctive sensitive 
periods. 



This activity takes place in what Dehaene and Naccache [9] 
denote the 'global workspace'. The global workspace serves, 
in the view of Baars and Franklin [10], as an exchange 
platform for conscious experience and working memory. 
Crick and Koch [11] and Koch [12] have suggested that 
the global workspace is made-up of 'essential nodes', i.e. 
ensembles of neurons responsible for the explicit represen- 
tation of particular aspects of visual scenes or other sensory 
information. 

B. Competitive dynamics and sensitive periods 

Within the setting of a generalized neural network we 
utilize a continuous-time formulation with rate-encoding 
neural activity centers, characterized by normalized activity 
levels Xi e [0, 1]. One can then define, quite generally, via 



(1 - Xi) n in > 0) 
Xi n {ri < 0) 



(1) 



the respective growth rates r^. Representative time series of 
growth rates are illustrated in Fig.|2] When the > the 
respective neural activity Xi increases, approaching rapidly 
the upper bound, as illustrated in Fig. [U when < 
it decays to zero. The model is specified [13], [14], by 
providing the functional dependence of the growth rates with 
respect to the set of activity-states {xj}. 

During the transition periods many, if not all, neurons 
will enter the competition to become a member of the new 
winning coalition. The competition is especially pronounced 
whenever most of the growth rates are small in magnitude, 
with no small subset of growth rates dominating the all the 
others. Whether this does or does not happen depends on 
the specifics of the model set-up, in Fig. |2] two cases are 
illustrated (upper/lower graph in Fig. In the first case the 
competition for the next winning coalition is restricted to 
a subset of neurons, in the second case the competition is 
network- wide. 

When most neurons participate in the competition process 
for a new winning coalition the model will have 'sensitive 
periods' during the transition times and it will able to react 
to eventual external signals. 




Fig. 3. Result for the activities Xi(t), the growth rates ri{t) and the input 
signals Ari{t) from a simulation of a system containing N = 15 sites 
(color coding). The time series of winning coalition is (from left to right): 
(1,5) (2,9,12) (0,2,3) ^ (8,10) if^ (5,14) ^ (4,9,12) ^ 
(12,13), where [..] corresponds to the transition-label given in the graph. 



C. Sensitive periods and learning 

So far we have discussed in general terms the properties 
of isolated models exhibiting a self-sustained dynamical 
behavior in terms of a never-ending time series of semi-stable 
transient states, as illustrated in Fig. 121 using rate-encoding 
equations specified in previous work [13], [14]. 

The importance of sensitive periods comes when this 
model is coupled to a stream of sensory input signals. It 
is reasonable to assume, that external input signals will 
contribute to the growth rates via 

n^n + Ari, (2) 

where the Ar^ encode the influence of the input signals. 
Let us furthermore assume that the input signals are suitably 
normalized, such that 



Ar,, 



0.5 (active input) 
(inactive input) 



(3) 



For the transient states the « — 1 for all sites not forming 
part of the winning coalition and the input signal will there- 
fore not destroy the transient state. With the normalization 
given by Eq. Q the total growth rate + Ar^ will remain 
negative for all inactive sites. The input signal will however 
enter into the competition for the next winning coalition 
during a sensitive period, providing an additional boost for 
the respective neurons. 

This situation is exemplified in Fig. [51 where we present 
simulation-results for a system containing iV = 15 neurons 
subject to two sensory inputs Ari(i). The self-generated 
time series of winning coalitions is both times redirected 
by the strongest component of the sensory input, the details 
depending on whether the input signal partially overlaps with 
the current winning coalition or not. 

We then have a model system in which there are well de- 
fined time-windows suitable for the learning of correlations 
between the input signal and the intrinsic dynamical activity, 
namely during and shortly after a transition period, that is 



the sensitive period. A possible concrete implementation for 
this type of learning algorithm wiU be given further below. 

This set-up would allow the system also to react to an 
occasional very strong input signal having A?-; > 1. Such 
a strong signal would suppress the current transient state 
altogether and impose itself. This possibility of rare strong 
input signals is evidently important for animals and would 
be, presumably, also helpful for an artificial cognitive system. 

D. Dijfusive learning signals 

Let us return to the central problem inherent to all system 
reacting to input signals having at the same time a non-trivial 
intrinsic dynamical activity. Namely when should learning 
occur, i.e. when should a distinct activity center become 
more sensitive to a specific input pattern and when should it 
suppress its sensibility to a sensory signal. 

The above developed framework of competitive dynamics 
allows for a straightforward solution of this central issue: 
Learning should occur then and only then when the input 
signal makes a qualitative difference, viz when the input 
signal deviates the transient-state process. For illustration let 
us assume that the series of winning coaUtions is 

(1,5) ^ (2,9,12) ^ (0,2,3) , 

where the index [a] indicates that the transition is driven 
by the internal autonomous dynamics and that the series of 
winning coalitions take the form 

(1,5) ^ (2,9,12) ^ (0,4,10) 

in the presence of a certain sensory signal [s\. Note, that a 
background of weak or noisy sensory input could be present 
in the first case, but learning should nevertheless occur only 
in the second case. 

Technically this is achieved by defining a suitable diffusive 
learning signad Sit). It is activated whenever any of the 
input signals Ar^ changes the sign of the respective growth 
rates during the sensitive periods, 

g ^ [ ^Iff in > 0) and (r, - An < 0) 
I -^Mff otherwise 

viz when it makes a qualitative difference. Here (r^ — Ar^) 
is the internal contribution to the growth rate, i.e. the value 
it would have in the absence of the input signal Ar; and the 
^d!iff > are the growth and decay rates for the diffusive 
learning signal respectively. The diffusive learning signal 
S{t) is a global signal and a sum over all dynamical 
variables is therefore implicit on the right-hand side of 
Eq. dUi. 

'The name 'diffusive learning signal' [6] stems from the fact, that many 
neuromodulators are released in the brain in the intercellular medium and 
then diffuse physically to the surrounding neurons, influencing the behavior 
of large neural assemblies. 
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Fig. 4. Schematic representation of the information flow from a raw 
pattern (bottom) via the input layer (middle) to the dHan layer (top). 
The synaptic strength v^^ connecting the input with the dHan layer (denote 
'v' in the graph) are adapted during the learning process (illustrated 
selectively by respective thick/thin blue lines in the graph). 
The dHan layer consists of active and inactive neurons (red/yellow circles) 
connected by intra-layer synaptic weights (denote 'w' in the graph). The 
topology shows five cliques (denoted C-I to C-V in the graph) of which 
C-II is active, as emphasized by the red-color neurons. 

E. The role of attention 

The general procedure for the learning of correlation 
between external signals and intrinsic dynamical state for 
a cognitive system presented here does not rule out other 
mechanisms. Here we concentrate on the learning algorithm 
which occurs automatically, one could say sub-consciously. 
Active attention focusing, which is well known in the brain 
to potentially shut off a sensory input pathway or to enhance 
sensibility to it, may very well work in parallel, to the 
continuously ongoing mechanism investigated here. 

III. ASSOCIATIVE THOUGHT PROCESSES 

So far we have described, in general terms, the system 
we are investigating, having sensitive periods during the 
transition periods of the continuously ongoing transient- 
state process, with learning of input signals regulated by a 
diffusive learning signal. 

Our system consists of two components, as illustrated 
in Fig. m For the component generating an infinite time- 
series of transient state we employ a dense homogeneous 
associative network (dHAN). The dHAN-model has been 
studied previously and shown to generated a time series of 
transient states characterized by high associative overlaps 
between subsequent winning coalitions [13], [14]. The time 
series might therefore be interpreted as an associative thought 
process, it carries with it a dynamical attention field [13]. 

A. Input data-stream analysis 

The input signal acts via Eq. (|2]l on the dHAN layer, with 
the contribution Ar^ to the growth rate of the dHAN neuron 
i given by 

3 j 

where we have denoted now with the superscripts p/q the 
dHAN- and input-layer respectively. For subsequent use we 



also define in Eq. ^ an auxiliary variable As^ which 
quantifies the influence of inactive input-neurons. The task is 
now to find a suitable learning algorithm which extracts the 
relevant information from the input-data stream by mapping 
distinct input-patterns onto selected winning coalitions of the 
dHAN layer This is the setup typical for an independent 
component analysis [15]. 

The multi-winners-take-all dynamics in the dHAN module 
implies individual neural activities to be close to 0/1 during 
the transient states and we can therefore define three types 
of inter-layer links vfj (see Fig. H]): 

• active ('act') 

Links connecting active input neurons with the winning 
coalition of the dHAN module. 

> orthogonal ('orth') 

Links connecting inactive input neurons with the win- 
ning coalition of the dHAN module. 

> inactive ( 'ina ') 

Links connecting active input neurons with inactive 
neurons of the dHAN module. 

The orthogonal links take their name from the circumstance 
that the receptive fields of the winning coalition of the target 
layer need to orthogonalize to all input-patters differing from 
the present one. Note that it is not the receptive field of 
individual dHAN-neurons which is relevant, but rather the 
cumulative receptive field of a given winning coalition. 

We can then formulate three simple rules for the respective 
link-plasticity. Whenever the new winning coalition in the 
dHAN-layer is activated by the input layer, viz whenever 
there is a substantial diffusive learning signal, i.e. when S^^jj: 
exceeds a certain threshold S^i^f, the following optimization 
procedures should take place: 

« active links 

The sum over active links should take a large but finite 
value r.T*, 
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The sum over orthogonal links should take a small value 
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The sum over inactive links should take a small but 
non-vanishing value rj/^*^. 
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Fig. 5. Examples of typical input patterns for a 5 X 5 bars problem with 
a probability p = 0.1 for the occurrence of the individual horizontal or 
vertical bars. The problem is non-linear since the pattern intensity is not 
enhanced when an elementary horizontal and vertical bar overlap. 



to the link plasticities: 

= r^^* e(xf - xl"^) Sign(r2'=* - 

= r™" e(xj,"'^ - ) Sign(C - 



„orth 



Arf) 
'Arf) 



(6) 



where the inputs Arf and Asf to the target layer are defined 
by Eq. Q. For the sign-function Sign(a;) = ±1 is valid for 
a; > and a; < respectively, 9{x) denotes the Heaviside- 
step function in Eq. (|6]l. The inter-layer links vfj cease to 
be modified whenever the total input is optimal, viz when no 
more 'mistakes' are made [16]. 

Using these definitions, the link plasticity may be written 

as 



+ C"' (1 - X 



(7) 



where S^ifj is an appropriate threshold for the diffusive 
learning signal r^"^* and r""^ are the desired contributions to 
the growth rate for active/inactive postsynaptic neurons, FJJ'^* 
and F*"" the respective rates and a;^'^* and a;*"" the critical 
activity reservoirs defining an active/inactive postsynaptic 
center respectively. A suitable set of parameters, which has 
been used throughout this work, is given in Table U 

We note, that a given interlayer-link vfj is in general 
subject to competitive optimization from the three processes 
(act/orth/ina). Averaging would occur if the respective learn- 
ing rates F^'^VF°''*'VF™'^ would be of the same order of 
magnitude. It is therefore necessary, that 



paci ^ j^orth 



B. Homeostatic normalization 



» r:; 



It is desirable that the interlayer connections vfj neither 
grow unbounded with time (runaway-effect) nor disappear 
into irrelevance. Suitable normalization procedures are there- 
fore normally included explicitly into the respective neural 
learning rules and are present implicitly in Eqs. (|6]l and 
Eq. O. 

The strength of the input-signal is optimized by Eq. ^ 
both for active as well as for inactive p-layer neurons, a 



TABLE I 

A POSSIBLE SET OF PARAMETERS FOR THE INTER-LAYER LINKS, WITH 



X inactive 



: 4.0 AND r 



dtff 



■ 0.08. 



and s"''*'* are the target values for the 
respective optimization processes. In order to implement 
these three rules we define three corresponding contributions 
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Fig. 6. The response, as defined by Eq. Js), for the 14 winning coalitions in the dHAN layer (compare Fig. |7) to the ten reference patterns, viz the 5 
horizontal bars and the 5 vertical bars of the 5x5 input field. 



property referred to as fan-in normalization. Eqs. (|6]l and 
O also regulate the overall strength of inter-layer links 
emanating from a given q-layer neuron, a property called 
fan-out normalization. 

Next we note, that the timescales for the intrinsic au- 
tonomous dynamics in the dHAN-layer and for the input sig- 
nal could in principle differ substantially. Potential interfer- 
ence problems can be avoided when learning is switched/one 
very fast, viz when activation and when the decay rate ^%ff 
are larger, i.e. when l/rj^yy^ is smaller than both the typical 
time scales of the input and of the self-sustained dynamics. 

IV. THE BARS PROBLEM 

A cognitive system needs to autonomously extract mean- 
ingful information about its environment from its sensory 
input data stream via signal separation and features extrac- 
tion. The identification of recurrently appearing patterns, i.e. 
of objects, in the background of fluctuation combinations of 
distinct and noisy patterns constitutes a core demand in this 
context. This is the domain of the independent component 
analysis [15] and blind source separation [17], which seeks to 
find distinct representations of statistically independent input 
patterns. 

In order to test our system made-up by an input-layer 
coupled to a dHAN layer, as illustrated in Fig. |4] we have 
selected the bars problem [18]. The bars problem constitutes 
a standard non-linear reference task for the feature extraction 
via an independent component analysis for a L x L input 
layer. Basic patterns are the L vertical and L horizontal 
bars. The individual input patterns are made-up of a non- 
linear superposition of the 2L basic bars, containing with 
probability p — O.l any one of them, as shown in Fig. |5] 



A. Simulation results 

For the simulations we presented to the system about 
Npatt ~ 200 randomly generated 5x5 input patterns of 
the type shown in Fig. |5] including a small noise-level of 
about 5%. The individual patterns lasted Tpatt — 30 with 
about Tinter = 190 for the time in between two successive 
input signals. These time-scales are to be compared with the 
time-scale of the autonomous dHAN-dynamics illustrated in 
the Figs. |2] and [3] for which the typical stability-period for 
a transient state is about Urans ~ 70. This implies that we 
presented the system sparse (in time) sensory input. We also 
note that the number of training patterns in our simulation is 
exceedingly small, standard neural algorithms use routinely 
^ 10"* and more training patterns [19]. 

The results for the simulations are presented in Figs.|6]and 
|7] For the geometry of the dHAN network we used a simple 
15-site chain containing 14 potential winning coalitions, as 
illustrated in Fig. H namely (0, 1), (1, 2), . . . , (14, 15). 

In Fig. |6]we present the response 

^ V AA'' a = l,..,14 

5(C„) . ^ ' /3=1,..,10 ^""^ 

of the 14 potential winning coalitions A to the 10 basic 
input patterns {xj'^,j = 1,..,25}, the isolated bars. Here 
Ca denotes the set of sites of the winning-coalition a and 
5(A) its size, here ^(A) = 2. 

The individual potential winning coalitions have acquired 
in the course of the simulation, via the learning rule Eq. (|7]|, 
distinct susceptibilities to the 10 bars compare Fig. |6] We 
note that the learning is unsupervised and quite fast. This 
example-problem is overcomplete, there are more potential 
winning coalitions than statistically independent basic pat- 
tern. Perfect signal separation can therefore not be expected. 
E.g. the second-last vertical bar in Fig. |7]is not resolved. 
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Fig. 7. Top: The receptive fields, Eq. (9), for the 14 winning coalitions 
[1], .., [5] (first row) [6], .., [10] (second row) and [11], .., [14] (third row) 
as a function of the 5x5 input field, compai'e Fig. \5\ For the response of 
the 14 winning coalitions with respect to the ten reference input patterns 
compare Fig. |6] Note that the receptive fields can be both positive as well 
as negative, see the color-coding. 

Bottom: The geometry of the dHAN layer as a linear chain. The winning 
coalitions [i] (i = 1, ..,14) are numerated and correspond here to two 
connected nearest-neighbor in a linear-chain layout. 



As the dHAN dynamics is competitive in nature already 
relatively small differences in the response, Eq. dH), may 
determine the outcome of the competition between two 
competing potential winning coalitions. This circumstance 
is born-out by the results presented in Fig. |6l 



B. Receptive fields 
The receptive fields 

1 V 



S{C^) 



(9) 



of the 14 potential winning coalitions are given in Fig. [7] 
The inter-layer synaptic weights vfj can be both positive 
and negative and the orthogonalization procedure, Eq. (|6]), 
results in complex receptive fields. 

Physically separated target neurons, viz a ' single- winner- 
takes-air set-up, are normally used for standard analysis 
neural algorithms performing an independent component 
analysis [15]. The winning coalitions of the dHAN-layer are 
however overlapping and every link vfj targets in general 
more than one potential winning coalition in the dHAN layer, 
in our case two for i — 2, .., 14 (and one for i = 1, 15). 

The unsupervised learning procedure, Eq. (|7]i, involves 
therefore a competition between the active contribution cf^* 
and the c°''*'* and c™°, as given by Eq. The inter-layer 
synaptic weights are therefore only changed to the point 
necessary for recognition, but not up to the saturation point, 
as evident from the data presented in Fig. [T] This behavior 
is consistent with the 'learning by mistakes' paradigm [16], 
which states that a cognitive system needs to learn in general 
only when committing a mistake. 

V. CONCLUSIONS AND FUTURE WORKS 

A cognitive system has its own internal dynamics and 
we studied here the interplay of these self-generated activity 



states, the time-series of winning coalitions, with the sensory 
input for the purpose of unsupervised feature extraction. We 
proposed learning to be autonomously activated during the 
transition from one winning coalition to the subsequent one. 

This general principle may be implemented algorithmi- 
cally in various fashion. Here we used a generalized neural 
net (dHAN - dense homogeneous associative net) for the 
autonomous generation of a time series of associatively con- 
nected winning coalitions and controlled the unsupervised 
extraction of input-features by an autonomously generated 
diffusive learning signal. 

We tested the algorithm for the bars problem and found 
good and fast learning for the case of sparse temporal 
input. Preliminary results indicate that the learning algorithm 
retains functionality under a wide range of conditions. We 
plan to extend the simulations to various forms of temporal 
inputs, especially to quasi-continuous input and to natural 
scene analysis and to study the embedding of the here 
proposed concept within the framework of a full-fledged and 
autonomously active cognitive system. 
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